CxS Data Quality Calculus : Assessing Data Quality in Heterogeneous Databases
نویسنده
چکیده
Organizations in various research, military, commercial, and governmental institutions have realized the importance of systems integration. As information systems are integrated and information highways established, quality of data flowing through these highways to data consumers becomes increasingly critical. A key, and largely unexplored, challenge lies in how to evaluate data quality. This paper attempts to ascertain, and represent, knowledge that allows assessing data quality with respect to a particular quality dimension. In addition, algorithms for computing such quality dimension values are investigated. In this research, for any quality dimension, its value is defined as a net effect of quality parameters that affect it. This research investigates the notion of a data quality calculus to combine effects of affecting quality parameters to a whole effect. The primary constructs of the data quality calculus consist of quality join and qualityjoin expression relations. The data quality calculus provides a mechanism via which data consumers can explicitly specify, according to their needs, relationships between quality parameters. In addition, a methodology for computing the quality join based on relationships between quality parameters is an integral part of the calculus. The explicit representation of affecting quality parameters and their relationships to each other greatly simplifies the algorithmic component to provide a data consumer with data quality information which is specifically tuned to the customer's needs. *Research conducted herein has been supported in part by the National Heart, Lung, and Blood Institute under the grant number R01 HL33041, in part by the National Institutes of Health under the grant number R01 LM04493 from the National Library of Medicine, and in part by the International Financial Services Research Center at MIT.
منابع مشابه
Data Investigation: Issues of Data Quality and Implementing Base Analysis Technique to Evaluate Quality of Data in Heterogeneous Databases
Data investigation is a process to understand the nature of data in heterogeneous databases. Many organizations are using online transactions systems to support their company operations. The diversity of applications system that used to support organization may lead to data anomalies without the system owners realized the negative impact of decision making from insufficient information of data....
متن کاملStudy of Spatial Data Quality Elements and VGI Linear Data Quality Assessment Methods
Volunteered Geographic Information has provided a rich and valuable resource for spatial data in a variety of applications. Despite the many benefits, this information does not provide any guarantee for their quality. So far, there are several methods to determine the quality of VGI. In addition to introducing quality elements and their evaluation methods, the present study attempts to explore ...
متن کاملAssessing Temporal and Spatial Variations of Groundwater Quality (A case study: Kohpayeh-Segzi)
Assessing the quality of groundwater is important to ensure the sustainablesafe use of these resources. However, describing the overall water quality condition isdifficult due to the spatial variability of multiple contaminants and the wide range ofindicators (chemical, physical and biological) that could be measured. Therefore, in thiscase study, some water quality parameters including Na, Mg,...
متن کاملMedical and Surgical Treatment of Reproductive Outcomes in Polycystic Ovary Syndrome: An Overview of Systematic Reviews
Background Polycystic ovary syndrome (PCOS) is a common and complex condition affecting up to 18% of reproductive-aged women with reproductive, metabolic and psychological dysfunction. We performed an overview and appraisal of methodological quality of systematic reviews assessing medical and surgical treatments for reproductive outcomes in women with PCOS. Methods This was an overview of syste...
متن کاملنگاشت علّی مدیریت محصول آماری با رویکرد کیفیت داده
Attribute data quality is important object for each databases. If data quality doesn’t useful, Decisions of organizations will not effective. Quality assurance is the new problem in the quality management. This approach flow to data quality management. Maybe many data are collected but they aren’t have accuracy elemnt. The propose of this paper is recognition data quality concept and dimensions...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010